1 

Cyclic-Coded Integer-Forcing Equalization 

Or Ordentlich and Uri Erez, Member, IEEE 



Abstract — A discrete-time intersymbol interference channel 
with additive Gaussian noise is considered, where only the 
receiver has knowledge of the channel impulse response. An 
approach for combining decision-feedback equalization with 
channel coding is proposed, where decoding precedes the removal 
of intersymbol interference. This is accomplished by combining 
the recently proposed integer-forcing equalization approach with 
cyclic block codes. The channel impulse response is linearly 
equalized to an integer-valued response. This is then utilized by 
leveraging the property that a cyclic code is closed under (cyclic) 
integer-valued convolution. Explicit bounds on the performance 
of the proposed scheme are also derived. 

Index Terms — Linear Gaussian channels, single-carrier mod- 
ulation, intersymbol interference, combined equalization and 
coding, cyclic codes, decision-feedback equalization. 

I. Introduction 

The intersymbol interference (ISI) channel with additive 
Gaussian noise is one of the most basic channel models 
arising in digital communications. Thus, considerable effort 
has been devoted to developing effective transmission schemes 
for this channel; see, e.g., jjj, for a comprehensive survey. The 
channel is described by 

yfe = a;fe + ^ h„iXk-rn + rik 

= Xfc + ISIfe + nfe, (1) 

where ISIfc is the ISI resulting from other data symbols, and 
Tik is additive white Gaussian noise (AWGN) with zero mean 
and unit power 

The channel model may further be characterized by the 
availability of channel state information (CSI), where we 
distinguish between the case where CSI is available to the 
transmitter and the receiver alike and the case where CSI is 
available to the receiver only. As we next briefly recall, while 
the distinction between these two cases does not make a great 
difference at high signal-to-noise ratios (SNR) (which is the 
main focus of this paper) in terms of capacity f2] (i.e., whether 
water-filling may be performed or not), it is of significant 
consequence for the design and implementation of equaUzation 
and coding schemes. 

In the past decades coding for AWGN channels has reached 
an advanced state, and practical coding schemes (e.g., turbo 
and LDPC codes) operating near capacity are known. It 
is thus desirable to combine AWGN coding and decoding 
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techniques with equalization in a modular way, with the aim 
of approaching the capacity of the ISI channel. 

The multitude of approaches developed to achieve reliable 
communication over the ISI channel may be roughly divided 
into two classes: multi-carrier approaches and single-carrier 
approaches. Both approaches may in principle be used to 
approach the capacity of the ISI channel, but offer different 
practical tradeoffs as we briefly touch upon next. 

In multi-carrier transmission, the ISI channel is transformed 
into a set of parallel AWGN subchannels, each subchannel 
corresponding to a different frequency bin and experiencing 
a different SNR. This approach has the advantage that the 
subchannels are (virtually) ISI free, and thus the problems 
of equalization and decoding are decoupled. However, it has 
some drawbacks: the alphabet size of the transmitted symbols 
is considerably enlarged, which in turn makes the approach 
inapplicable to some media, such as magnetic recording 
channels. A related phenomena associated with multi-carrier 
transmission is that it results in a high peak-to-average power 
ratio which may also be undesirable (see e.g., jSj, H). 
Furthermore, when CSI is available only at the receiver, bit 
allocation is precluded, and channel coding and decoding 
become more difficult, due to the variation of the SNR across 
subchannels. 

Single-carrier approaches try to eliminate most of the 
ISI without severely increasing noise power. The simplest 
approach is linear equalization consisting only of a "feed- 
forward" equalizer (FFE), which roughly transforms the chan- 
nel into an additive colored Gaussian noise channel, where the 
minimum mean-squared error (MMSE) criterion corresponds 
to (linearly) maximizing the signal-to-interference-plus-noise 
(SINR) at the "sheer". Performance may be improved us- 
ing (non-linear) decision-feedback equalization, in addition 
to FFE. Specifically, MMSE decision-feedback equalization 
(DEE) will be discussed in greater detail in the next section. 
In fact, as shown by Guess and Varanasi Q, the MMSE- 
DFE architecture is optimal in the sense of attaining mutual 
information, and allows to approach capacity with AWGN 
encoding/decoding, if decisions (fed to the DFE) are based 
on codewords rather than symbols. See also |6|, |7|. 

Unfortunately, the Guess-Varanasi approach, while quite 
pleasing from a theoretical perspective, requires long inter- 
leaving as well as long zero-padding, which in turn requires 
long latency. This drawback can be avoided if CSI is available 
at the transmitter by Tomlinson-Harashima precoding ( 121, 
|8J), which essentially moves the DFE to the transmitter, but 
is inapplicable if the transmitter has no knowledge of the 
channel. 

The approach proposed in this paper allows to avoid error 
propagation without incorporating an interleaver and with 
CSI available at the receiver only. In essence, the proposed 
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method enables to perform (soft or hard) block decoding 
before decision feedback is performed. 

We build on the integer-forcing (IF) equalization approach, 
which was recently proposed ID in the context of general 
MIMO channels with CSI available at receiver only. In this 
approach, multiple streams are encoded using an identical 
linear code and the receiver equalizes the channel matrix to 
any full-rank integer matrix rather than to the identity. 

In the context of IF equalization, an ISI channel may be 
viewed as a Toeplitz matrix. This special structure allows to 
further replace the multiple codewords (in the context of ISI, 
this number would be large) with a single codeword provided 
that the linear code is further a cyclic cot/eQ 

The paper is organized as follows. Section HIl recalls some 
basic results on single-carrier equalization. Section |III] de- 
scribes IF equalization as well as the general structure of the 
proposed scheme. Section |IV] derives the criteria for choosing 
the FFE equalizer. Section [V] derives bounds on the attained 
performance. In Section [VTl the performance of the scheme is 
analyzed, and some examples are given. Section FVlIl discusses 
practical coding techniques for integer-forcing equalization 
at high transmission rates. The paper concludes with Sec- 
tion |Vlll] 

II. Preliminaries 

We briefly review basic single-carrier equalization archi- 
tectures. In the sequel, we use /^-transform notation for 
sequences; i.e., a sequence {sk} is represented by its D- 
transform S{D) = s^D^. For example the channel ^ 
may be expressed as, 

Y{D) = H{D)X(D) + N{D). 

The simplest criterion for Unear equalization (LE) is that of 
Zero-Forcing (ZF), where the ISI is completely canceled using 
an FFE only. This corresponds to taking the front end (hnear) 
filter to be 

AMD) = J^y 

resulting in the equalized channel response G{D) = 1. The 
induced noise enhancement can be large, especially when 
H{D) has zeros near the unit circle. A variant that takes into 
account both ISI and noise enhancement is the linear MMSE 
equalizer 



A 



MMSE-LE 



H*{D- 



H{D)H*{D- 



1/SNR' 



The MMSE-LE suffers from smaller (and in particular 
bounded) noise enhancement while allowing some residual 
ISI. The MMSE criterion is equivalent to maximizing the 
SINR at the sheer input [6l. 

Decision-feedback equalization (DEE) (see Figure [T]) is 
based on using previously detected symbols in order to cancel 
the induced ISI from the symbol entering the slicer. In this 
approach, if all previous data symbols are detected without 

'The crucial element needed is that it be a linear shift invariant code, not 
necessarily cyclic. 



error, then postcursor ISI can be removed. Specifically, the 
output of the FFE in Figure [T] is given by 

y'k ^ Xk * gk + Zk 

— 1 CXD 

-^k H" ^ ^ •^k — 7n9fn ^ ^ •^k — 7n9m ^k 
m— — oo m— 1 



= Xk+ ISIf 



where G{D) is the equivalent channel after the operation of 

the FFE. The DEE then subtracts ISI^ — J2m=i ^k~mgm 
from y^, where Xk are decisions on past transmitted symbols, 
giving rise (assuming correct past decisions) to the equivalent 
channel 



Vk 



Xk 



— OO 

E 

7n— — l 



Xk^ 



^g-m ^" Zk' 



MMSE-DFE is optimal, when using the optimal FFE, in 
the sense that the SINR at the slicer (assuming correct past 
decisions) is equal to the SNR of an AWGN channel having the 
same capacity as that of the ISI channel 0. Combining DFE 
with coding, however, is a non-trivial task. Since a decision on 
the value of the last symbol Xk must enter the feedback loop 
at every time instance, there is an intrinsic tension with the 
latency required for channel coding. Many approaches have 
been suggested in order to overcome this obstacle (see for 
example lfT0ll - lfT4ll ). but to the best of our knowledge none 
of them allow to exchange the order of decoding and ISI 
removal which is the aim of the present work. Doing so, 
directly addresses the basic problem of ensuring that reliable 
decisions enter the DFE loop. 

III. Combining Cyclic Codes With IF Equalization 

We begin this section by recalling IF equalization in the con- 
text of ISI channels. For a more general description, see ||9|- 
For purposes of simplicity of exposition, we limit ourselves 
in the sequel to real-valued channels (and transmission of real 
symbols). The extension of the proposed scheme to complex 
transmission is straightforward, and, unless stated otherwise, 
all results derived in this work hold for the complex case as 
well. Further, we first describe the ZF-IF approach which is 
suitable for the high SNR regime, and relegate the extension 
to MMSE-IF (which is suitable for any SNR) to the Appendix. 

An IF equalizer (depicted in Figure |2]), rather than at- 
tempting to cancel the ISI (as in linear ZF as well as ZF 
decision feedback), ensures that the ISI is restricted to take 
only integer values. Assuming the data symbols are taken 
from a constellation of size q consisting of the integers 
{0, 1, . . . , q — 1} (i.e., a PAM constellation), which we may 
identify as the ring Z^, the ISI will take only integer values 
provided that the channel impulse response is equalized to an 
impulse response I{D), such that all the coefficients of I{D) 
are integers. We restrict attention to responses 1(D) of finite 
length and further assume that I{D) is a moniqj polynomial 

-There is some loss of generality in this assumption. However, the optimal 
choice for I{D) (as we discuss in Section Hvl is almost always monic. When 
this isn't the case, the proposed approach is not very effective. The restriction 
that I{D) be monic can be removed with some modifications to the scheme. 
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Fig. 1. Decision-feedback equalization. 
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Fig. 2. A schematic description of an IF-DFE system. 



I{D) - I 



(with integer coefficients), i.e., 



k=0 

where io = 1. We denote the vector of coefficients of I{D) 

by, 

1 = [1 ii ... in-l] . 

Note that taking I{D) = 1 is a special case in which the IF 
equalizer reduces to a ZF-LE one. In most cases, choosing 
/(D) otherwise results in smaller noise enhancement. The 
criterion for determining the feed-forward filter for the IF 
equalizer will be given in the next section. We further note 
that IF equalization is closely related to partial response 
signaUng (see for example 1 15 1), and was previously suggested 
in |fT6l where a suboptimal algorithm for finding a good 
integer channel I{D) was also suggested. Another work of 
similar spirit is ifTTi where it is shown that in some cases 
the complexity of the maximum likelihood decoder that is 
matched both to the channel and the code can be reduced 
if the channel is integer-valued. 

The key feature of IF equalization exploited in this paper 
is that, as we observe below, if the transmitted data is taken 
from a cyclic code, then the output of the equalized channel 
is also a member of the codebook. 

Assume some monic integer-valued filter was chosen by the 
receiver (the considerations for choosing 1(D) are discussed 



in Section IIVI ). The FFE applied as the front-end filter is 
therefore, 

. (2) 

The output of the equalizer is 

Y'{D) = X{D)I{D) + Z{D), 

where Z{D) = N{D)I{D)/ H{D) is colored Gaussian noise. 
We further define the SNR to be, 

SNRzF-IF-DFE = 



27r J-TT 



The impulse response I{D) should be chosen so as to max- 
imize SNRzF-iF-DFE- The definition of SNR (i.e, which does 
not include the residual ISI as noise) is justified when using 
a cyclic code as described next. 

Definition 1: A linear block code C of length N over Zg is 
called cyclic, if for every codeword x G C, all cyclic shifts of 
X are also codewords in C. 

We denote cyclic convolution w.r.t. block length N by 0. 
The following proposition is an immediate consequence of 
Definition [1] 

Proposition 1: Let C be a cyclic code of length N over Z^. 
Then for any vector i of length N with integer entries, 

C(g)iCC. 
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That is, C is closed under integer-valued cyclic convolution 
over Zq. 

We conclude that if the linear convolution Xk*ik over K were 
a cyclic convolution Xk i8) ik with operations over Z^, then we 
would be able to decode x' = x (g) i directly from the output 
y', and then reconstruct x from it0 

The transformation to cyclic convolution may be accom- 
plished in much the same way as in DMT/OFDM transmission. 
In order to do so, we use a systematic cyclic encoder at 
the transmitter such that the last K symbols of a [N, K] 
(cyclic) block code are the data symbols and the first N — K 
symbols are redundancy symbols. We fix the last n — 1 
information symbols to be zeros, where n is the length of 
I{D). Consequently, all codewords end with n — 1 zeros. The 
effective rate is reduced to [K — n + i)/N. 

Due to the zero padding, if we take only the first N samples 
of the output of the FFE, reduced modulo q, we get, 

y'l. mod q — [{x^ i)k + Zk] mod q, for k — 1, . . . , N . (3) 

This process is illustrated in Figure |3] The output of the 
FFE is processed in blocks of length N (the blocks play no 
role in the operation of the FFE itself). The zero padding, 
in addition to initializing the DEE with zeros, as described 
below, ensures that when a new data block is transmitted, the 
"channel's memory" is empty, i.e., there is no ISI between 
consecutive blocks. 
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Fig. 3. An Illustration of transforming a linear convolution over R into 

a cyclic convolution xj. ® over Z7. In this example I{D) = 1 + 2D + . 



Since the result of the cyclic convolution is itself a code- 
word, we may apply the channel decoder directly to it. 
Assuming correct decoding, we next reconstruct Xk from the 
decoded result of the cyclic convolution. This can be done in a 
recursive way by applying a DFE as shown in Figure |20 Since 
we know the last n — 1 values of x^ (i.e., they are all zero), 
we are able to recover Xk for every k ~ 1, N — {n — 1). 

' It is important to note that in order to reconstruct x from x C3 i mod q, 
all elements of x must be in Zq. Nevertheless, the elements of i need only 
be integers but need not necessarily be restricted to Zq. 

''ah operations of the DFE are carried over Zq It is always possible to 
reconstruct xj. regardless of q (i.e. whether it is prime or not) and the choice 
of the integer-valued filter since I{D) is monic. 



The reconstruction of x^ from x^. is performed using 
a DFE rather than a linear equalizer, as the integer filter I{D) 
often has zeros on the unit circle, and hence it is not invertible. 

We end this section by remarking that the recently proposed 
"Signal Codes" ifTSl . though not cyclic, are also suitable for IF 
equalization. Signal codes are a special family of lattice codes 
which are generated by convolving sequences of data symbols 
taken from a PAM/QAM constellation with a monic filter 
F{D). For data sequences of length N and an encoding filter 
F{D) of length M, the generating matrix of the lattice code 
is an {N + M — 1) x N Toeplitz matrix. For a data sequence 
B{D) the corresponding codeword i^ X{D) = B{D)F{D). 
When IF equalization is used in conjunction with signal 
codes, the output of the equivalent channel after the FFE is 
(neglecting the additive noise) X'{D) = B{D)I{D)F{D). 
Let B'{D) = B{D)I{D), and note that B'{D) is a sequence 
of {N + n - I) PAM/QAM symbols as I{D) is an integer 
valued filter of length n. It follows that X'{D) is a codeword 
from a signal code generated by F{D) with a generating 
ToepUtz matrix of dimensions {N + M + n — 2) x {N + n— 1). 
For large enough N, this codebook has similar performance 
to those of the original codebook. Therefore B'{D) can be 
decoded and then B{D) can be reconstructed from it. Note 
that for signal codes there is no need to transform the linear 
convolution performed by the channel into a cyclic one, as 
done when cyclic codes are used. In ifTSl it is demonstrated 
that signal codes can operate reasonably close to capacity with 
an acceptable complexity, and may therefore be attractive for 
IF equalization at high SNR. 

IV. Criteria for Choosing The Filter I{D) 

We wish to find an integer-valued filter I{D) of length n 
such that the noise enhancement experienced by the ZF-IF 
equalizer for a given channel H{D) is minimized. The noise 
variance at the output of the IF front-end filter ^ is, 

_ 1 r |/(e^")p 

'''ZF-IF-DFE ~ 27r 



Denote, 



K{D) 



|i7(eJ^)|2 
1 



duj. 



(4) 



H{D)H*{D-*) 



(5) 



Thus, km is an autocorrelation sequence (and in particular 

k—m — krn). 

Using straightforward algebra, (|4]i may be written as a 
quadratic form, 

"'ZF-IF-DFE ~ 

where K„ is the positive semi-definite Toeplitz matrix. 



fco 

fci 



k-i 
ko 



k 



n— 1 



k 



n-2 



kn~3 ■ 



k_ 



'("-1) 
-(n-2) 



ko 



^In fact, some additional "shaping" steps must be taken in order to assure 
that the power constraint is preserved. The full details can be found in 1181 . 
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Let F„ be any matrix satisfying. 



where 



K — F F 



We therefore have. 



c^zF-iF-DFE = iK«i^ = iFn F„i^ = ||F„i^ 11^ (6) 

Equation ^ implies that finding the optimal (ZF) integer- 
valued filter I{D) is equivalent to finding the shortest vector 
in the lattice A(F„), which is composed of all integral com- 
binations of the columns of F„, i.e.. 



A(F„) = {A = F„i I i e Z"}. 



(7) 



Finding the shortest lattice vector is known to be NP hard, 
but fortunately efficient suboptimal algorithms for finding a 
short lattice basis are known. An important representative of 
this class of algorithms is the celebrated LLL algorithm |fT9l , 
which has polynomial complexity and usually gives adequate 
results in practice. In order to find a "good" integer-valued 
filter I{D), we may therefore apply the LLL algorithm on 
F„. The algorithm's result is a new basis (of short vectors) 
for F„. We then need to find the shortest vector v in this 
basis and choose i = F„^^v. In the next section we derive an 
upper bound on the induced noise enhancement when the true 
(optimal) shortest vector of the lattice is used, which serves 
as a useful benchmark. 

V. An Upper Bound On The Noise Enhancement of 
OPTIMAL ZF-IF Equalization 

We now upper bound the noise enhancement induced by 
optimal ZF-IF equalization. Throughout this Section we as- 
sume that the channel has a finite-length impulse response of 
length p + 1, i.e.. 



H{D) 



p 

E 

fc=0 



The length of the channel will play an important role in the 
derived bound. 

We proceed to upper bound ctIf-if dfe using known results 
from the theory of lattices, and the theory of Toeplitz matrices. 

Lemma 1: (Minkowski) Let Ai (A (F„)) denote the shortest 
vector in the n-dimensional lattice A (F„). If F„ is of rank n, 
then, 

2" 



Ai(A(F„))< j |det(F„)|-, 

where /?„ is the constant of proportionality of the volume of 
an n-dimensional sphere, i.e., the volume of an n-dimensional 
sphere with radius R is given by Vn{R) — PnR""- 



Proof: See, e.g., ||20|. 
The constant /3„ can be bounded by (see, e.g., ||2TI ). 



/27re 



Tl/2 



\ n J y/lAnn 
RecalUng that K„ — F„^F„, we therefore have, 

A? (A (F„)) < r7(n) • [det(K„)] " , (8) 



2n : 
mn) = — • (1.47rn)' 
7re 



(9) 



Therefore, in order to obtain a closed-form bound on 
Af(A(F„)), it suffices to evaluate dct(K„). To that end, 
define. 



an 



\zoZi 



|2p 



1 



(10) 



where Zq, Zi, . . . , Zp^i are the maximum-phase zeros of 
H{D)H*{D^*) (i.e., the zeros outside the unit circle). 

We are now ready to present Theorem [T] which is the main 
result of this section. The theorem makes use of a result from 
the theory of Toeplitz matrices which we state first, and the 
proof of which can be found in |22, Chapter 5]. 

Lemma 2: Let H{D) — TTk=o ^kD^ be a polynomial of 
degree p. Further, let K{D) be as defined in (|5]l. Then for 
every n>p + l. 



[det(K„)] " = (a//)" • exp 
where an is defined in ( fTOl i. 



1 

2^ 



logX(e^")da; 



Theorem 1: Assume that the channel has a finite-length 
impulse response H{D) = Y%=o^kD'^ ■ For an optimal 
choice of the integer filter I{D), the noise power at the output 
of the filter I[D) /H{D) is bounded b>@ 



< f^ZF-DFE • min 

n>p-\-l 



ri{n) ■ [aH)'-' 



(11) 



where ri{n) is defined in (|9]l and au is defined in ( fTol i. 
Proof: Lemma |2] may be rewritten as 

[det(K„)] " = (a^f) " • a|F_DFE> (12) 



where 



'ZF-DFE 



exp 



1 

2^ 



\og\H{e^'^)\'^(hj 



(13) 



is the variance of the AWGN after the optimal FEE is applied 
in ZF-DFE (see, e.g., |l6l). Substituting (O into dSjl yields: 

Ai(A (F„))' <Tl{n) ■ {any ■ tIf-bfe- 

Since ctIfjf-dfe ~ theorem is proved. ■ 

The additional noise enhancement caused by the ZF-IF 
equalizer w.rt. an optimal ZF-DFE as bounded in (fTTI) consists 
of two factors: (a^f)" and ri{n). The factor (a//)" is greater 
(or equal) to 1 and tends to 1 as n — >^ cxd. Thus, allowing 
for a long filter I{D) mitigates the effect of this factor On 
the other hand, the factor ri{n) is (approximately) linearly 
increasing with n. The minimization in ( fTTT i strikes a balance 
(i.e., searches for the optimal tradeoff) between these two 
factors. 

'For (complex) transmission over a complex channel, (9) changes to: 

4n , ,1 
rj{n) = — ■ (2.87rn)2^r . 
7re 
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The tightness of the bound depends on the tightness of the 
Minkowski bound of Lemma [T] It is known that there exist 
lattices that satisfy this bound with equality, but for "most" 
lattices the shortest lattice vector is much shorter than what 
this bound predicts. We further note that the family of lattices 
considered in this paper is not general, as F^F„ is a Toeplitz 
matrix, and therefore the Minkowski bound may never be tight, 
as we further discuss in the next section. 

VI. Performance of ZF-IF Equalization 

The capacity of the Gaussian ISI channel ^ at high SNR 
is given by (see e.g. f6\) 



C=bog, (4^(1 + 0(1))), 

^ \ "^ZF-DFE 



(14) 



where cr^p.^FE defined in ( fT3] ). and where o(l) — > as 
SNR oo. Note that (fT4l l is valid only for channels for which 
(t|p jjpg is finite, i.e. for channels satisfying the Paley-Wiener 
condition (see |I2)- 

In this section we analyze the total gap-to-capacity of 
the ZF-IF equalization scheme at high SNR, i.e. the gap 
between the performance obtained using ZF-IF and the optimal 
performance ( fT4] i. 

The ZF-IF scheme, as described in Section Hill transforms 
the original Gaussian ISI channel into an equivalent modulo- 
additive channel 

Y'{D) = \X{D)I{D) + Z{D)] mod q 
= [X'{D) + Z{D)] mod q, 

where Z{D) is filtered Gaussian noise. As we recall, correct 
decoding of X'{D) ensures correct reconstruction of X{D). 
In practice, the transmitted signal Xk is subject to the power 
constraint 

E [xl] < SNR, 

and hence, instead of mapping the codewords from the cyclic 
code C to the constellation Zq, we map them to the constel- 
lation 



cVSNR 



-{q-1) -iq-3) 



1 



(15) 



2q ' 2(7 ' ' 2q 

The constant c is chosen such that the power constraint is 
satisfied with equality for a uniform distribution over the 
constellation. It can be easily verified that c > -\/l2 for any 
value of q and approaches \/T2 for large values of 

Let A = cvSNR. Using the constellation ( fTsT i, the equiv- 
alent modulo-additive channel after the FFE can be rewritten 



Y'{D) = [X'{D) + Z{D)] mod [-A/2, A/2) 



(16) 



In order to analyze the performance limits of the equivalent 
channel, we lower bound the mutual information I{X';Y') 
between its input and output. The mutual information cor- 
responding to a certain distribution on the channel's input 

'The codebook C is invariant to the convolution with I{D) as defined in 
Section Hn] and hence X'(D) is uniformly distributed over the constella- 
tion (15), as is X(D). 



gives the highest possible rate for reliable communication, 
when a random channel code which is drawn according to that 
distribution is used [23 1. However, the equivalent channel (fT6l l 
is attained using a linear cyclic code (as opposed to a random 
code), and thus the mutual information may never be achieved. 
Nevertheless, the mutual information is a useful metric for the 
performance of ZF-IF if we account for the possible loss of 
rate due to the use of a linear cyclic code. 

The channel ( fT6] l is a modulo-additive colored Gaussian 
noise channel, where the variance of the noise is cr|p4p_Qp£. 
The mutual information between the input and output of such 
a channel is maximized by a uniform memoryless distribution 
over the modulo interval. Moreover, for such an input distri- 
bution, the mutual information is lower bounded by that of 
a modulo-additive channel with white Gaussian noise having 
the same variance, see ll24l . Let Z he a Gaussian random 



variable with zero mean and variance cr|p [p ]-,pp. If X' has 
a memoryless uniform distribution over [—A/2, A/2), which 
corresponds to taking q to infinity, Y' has a memoryless 
uniform distribution over the interval as well, and we have 

I{X'; Y') > log2(A) - h{Z mod [-A/2, A/2)) 

>log^{A)-hiZ) (17) 

= 2 log2(^^) - 2 log2(27i-e • ctIf-if-dfe) 

1 , / SNR 12 \ 
= 9 log2 — ^ ' 

^ \ '^ZF4F-DFE ^"''2 / 

where (fTTb follows from the fact that modulo reduction can 
only decrease differential entropy. Let us express cr|p_[p_j-,pg as 
a product of two factors 

2 _ 2 
'^ZF-IF-DFE — ll,H ■ fzF-DFE; 

where 7/,h is the additional noise enhancement caused by the 
FFE in ZF-IF w.r.t. that caused by the FFE of ZF-DFE. With 
this notation we have 

1 1 f SNR 1 12 

I{X';Y')>-logJ- — 

2 Vc^zF-DFE 7/,-ff Svre 



= 1 logs 



j^SNR/r,,H 



^ZF-DFE 



where 



2^e 
^'^ ^ T2~ ' 



is the SNR loss due to IF-ZF equalization. It follows that the 
gap-to-capacity in dB is given bjH 



ioiogio(r/,H) - 10 log 



/27re 



101ogio(7/,H)- (18) 



The first term on the r.h.s. of ( fTST i is the well-known high- 
SNR shaping gain, which equals 1.53dB. The reason for this 
loss is the one-dimensional modulo operation we use at the 
receiver. The modulo operation allows the decoder to use 
the original codebook C for decoding x' which preserves 
the original decoding complexity of the codebook. However, 
decoding the equivalent channel's output after the modulo 

*This is the gap-to-capacity in the case of (complex) transmission over 
complex channels as well. 
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reduction is equivalent to searching for the point that was 
most likely transmitted over the infinite lattice C + AZ^. This 
is strictly suboptimal since no more than points of the 
infinite lattice, which correspond to C ® i (with operations 
carried over the reals), are valid, and hence better performance 
can be achieved by searching only over these valid points at 
the decoder. 

While the modulo loss amounts to 1.53dB at high-SNR, 
at low SNR the loss may be significantly larger (see, e.g., 
||24l), which makes IF equalization less attractive in that 
regime. This loss can be mitigated by incorporating shaping 
at the transmitter, and/or avoiding the modulo reduction at the 
receiver. Implementing such modifications at the transmitter 
and receiver without significantly increasing the computational 
complexity is an interesting avenue for future research. 

The second loss in (fTSl l is related to the additional noise 
enhancement caused by the FFE. It follows from Theorem [T] 
that for the optimal choice of the integer-valued filter, is 
upper bounded by 



7ff = min ji h ^ min 

I{D) ' n>p+l 



r]{n) ■ (an) 



(19) 



In order to gain more insight regarding the term 7/ we 
illustrate the effect of the noise enhancement through the 
following examples of ISI channels. 

Example 1 - Two-tap real "RAKE" channel: An inter- 
esting example is a real channel with only two non-zero taps 
(which may be arbitrarily far apart), namely H{D) = 1+aDP. 
Without loss of generality, we can assume that \a\ < 1, 
as otherwise we can transform the channel into the channel 
H{D) = 1 + by using an all-pass filter. It is rather 
obvious that the optimal choice for I{D) is either Iq{D) = 1 
or Ii{D) = 1 + -^\DP. It can be shown by straightforward 
algebra that for the choice Io{D), the noise enhancement is 
given by 1/(1 — a^), and for the choice Ii{D) the noise 
enhancement is given by 2/(1 + \a\). It follows that for 
\a\ < 1/2, Io{D) is better, while for 1/2 < \a\ < 1, Ii{D) is 
better, and the maximum noise enhancement (which occurs 
for \a\ = 1/2) is 4/3 « 1.25dB. Since for this channel 
'''zF-DFE ~ 1' '^i,H is equal to the noise enhancement. Figure|4] 
depicts 7/ for the real two-tap channel with an optimal 
choice of I{D), along with the bound ( fT9] ) for the same 
channel, and the noise enhancement caused by a ZF-LE. It 
is evident from the figure that for this channel the bound is 
not tight. 

Example 2 - Two-tap complex "RAKE" channel: Con- 
sider the channel from the previous example H{D) ~ l+aZJP, 
where now a — re^^ is a complex number. We assume without 
loss of generality that \a\ < 1. In contrast to the real-valued 
two-tap channel, in this case there is no clear choice of I{D) 
for each value of a. Nevertheless, similar to the real two-tap 
channel, 7// is independent of the delay p between the first 
and the second tap (as is also the case for the ZF-LE). This 
follows since the noise enhancement caused by each of the 
filters Ai{D) = I{D)/H{D) and A2{D) = I{DP)/H{DP) 
is the same. Therefore, if for the channel H{D) = 1 + aD a 
certain choice of I{D) results in noise enhancement of ^/i^h. 



■ ZF-LE 
- Bound on 



1(D) = 1 



I{D) = 1+D 




Fig. 4. 7/.H for the real two-tap channel H{D) = l + aDP. The solid line 
shows "fi^H for the optimal choice of I{D) for any p, the dashed line shows 
the bound (19) for p=l and the dotted line shows the noise enhancement 
induced by ZF-LE 



for the channel H{D) = 1 + aDP, the choice I{D) = I{Dp) 
results in 7/ as well. While the performance of a ZF-LE 
is independent of the phase 9, the noise enhancement caused 
by the ZF-IF equalizer is significantly influenced by 9. It is 
interesting to note, however, that for the two-tap channel, the 
bound iT% (in its complex form) is independent of 9. Figure |5] 
depicts jj^H (in dB) for the two-tap channel with an optimal 
choice of I{D) (which was found numerically), for values of 
\a\ < 0.99. 
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Fig. 5. 7/,jf (in dB) for the complex two-tap channel H{D) = 1 + (x - 
j ■ y)D^ , when the optimal integer-valued filter I{D) is used. 



Example 3 - An ISI channel with random taps: In this 



g 



example we consider the channel 



k=0 

where {hk}l.^Q are real i.i.d. random Gaussian variables with 
zero mean and unit variance. We numerically evaluate the 
probability density function (pdf) of jj^h in dB for the optimal 
choice of I{D). Figure |6] depicts the results for p = 3, p — 5 
and p — 7. The results show that becomes larger when the 
channel is longer, which is also the behavior of the bound iT% . 




Fig. 6. The pdf of 7/,h (in dB) for the ISI channel H(D) = Efe^o hkDP 
where {/ifc}^_Q are i.i.d. random Gaussian variables with zero mean and unit 
variance, when the optimal integer-valued filter I{D) is used. 



A third loss incurred by ZF-IF equalization is related to the 
constraint that the channel code is cyclic. The gap-to-capacity 
of ZF-IF equalization at a certain block error probability, is 
the sum of the shaping loss, the additional noise enhancement 
7/,H (in dB) and the gap-to-capacity (in dB) of the chosen 
cyclic code at the desired block error probability. To the best 
of the authors knowledge, it is still an open question whether 
cyclic codes can attain the capacity of the AWGN channel, 
let alone, that of the mod-A AWGN channel. Nevertheless, 
algebraic cyclic codes (such as BCH for example) have 
been extensively used in communication systems for decades 
due to the fair tradeoff they offer between performance and 
complexity. Modern (e.g., LDPC) cyclic codes are a subject 
of extensive research nowadays, and some families of such 
codes were reported to have very good performances over the 
AWGN channel! see e.g. 123, ||26|. 

Most of the research effort on linear cyclic codes have been 
devoted to binary codes. In the high-SNR regime, which is 
the focus of this paper, the cardinality of the code must be 
greater than two, and a binary code does not suffice. In the 
next section we propose a practical coded modulation scheme 

'Note that in ZF-IF the additive noise after the FFE is not white, and thus 
it is important that the cyclic code should not be sensitive to the memory of 
the noise. 



for high transmission rates that utilizes binary cyclic codes 
and is suitable for cyclic-coded integer forcing equalization. 

VII. Practical coding at high rates 

In order to achieve high transmission rates, using a binary 
channel codebook, we transmit a combination of coded and 
uncoded bits which are mapped to a 2^^-PAM constellation 
points by natural labeling. Namely, we transmit^ 



M-l 



J2 

6=1 



(20) 



where Xc is a codeword of length N from the binary linear 
cyclic code C, and x„^, b = 1, . . . , M — 1, are blocks of 
uncoded information bits having the same length (N) as that 
of Xc. Our goal is to detect (x (g) i) mod 2^^ without error, and 
apply the DFE (with all operations carried over ) in order 
to reconstruct xFI 

We now show that natural labeling retains the closure 
property under integer-valued cyclic convolution. 

Lemma 3: Let A denote the set of vectors x given by 
such that Xc G C. Specifically, 



A = {x|xceC,x„, eZf, b^l,...,M-l} . 

If X e A, then (x (g) i) mod 2*^ e A. 

Proof: Using natural labeling, it is clear that the result of 
the cyclic convolution reduced modulo 2*^ can be written as, 

(x (8) i) mod 2*^ = xq + ^ Xh2^ 

fc=i 

for some vectors Xg, Xi, . . . , xm-i G '^2 ■ Clearly if xq G C 
then (x (g) i) mod 2^^ G A. We have, 

xo = ((x® i) mod 2^^) mod 2 

= (x (g) i) mod 2 

' M-l \ \ 

(g) i mod 2 




/ M-l \ 

Xc (g i + ^ 2'' (x„, (E) i) 

V b=l J 



mod 2 



(a) 



(Xc 



{b) 

i) mod 2 e C, 



where (a) holds since (x„j, g) i) G Z^, and (&) holds because 
C is a binary linear cyclic codebook. Thus, xq is indeed a 
codeword. ■ 
At the equalizer's output we get a corrupted version of x(gi, 
and after reducing modulo 2, we get xo (plus folded Gaussian 
noise). Since xq G C, it can be decoded before preceding to 
detect the uncoded bits. A complete knowledge of xq divides 
the constellation into two cosets, which makes the distinction 
between two points from the same coset easier than the 

'"Due to the power constraint, the transmitted signal, in effect, would be 
given by {B) with g = 2*^. 

' ' Note that in order to reconstruct x we need to know the value of its last 
L samples. We therefore need to zero-pad the last L uncoded bits as well as 
the last L coded bits. 
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distinction between two points of the full constellation, thus 
doubling the Euclidean minimum distance in the constellation 
as in Ungerboeck's set partitioning (see, e.g., 11]). We can 
therefore first decode Xq and then detect the uncoded bits using 
a sheer with double step size. For more details, see ||27| . ||28| . 

VIII. Discussion and Conclusions 

We have presented a novel DFE scheme for the discrete- 
time linear Gaussian channel, suitable for single-carrier trans- 
mission where channel state information is not available at 
the transmitter. The scheme enables block decoding to be 
performed before applying the DFE when a cyclic code is 
used. The channel is equalized to an impulse response that 
is comprised of integer coefficients only. The performance of 
the proposed scheme was analyzed, and in particular an upper 
bound on the induced noise enhancement was derived. Several 
examples of ISI channels were examined and it was shown 
that in many scenarios the scheme achieves a rather small 
gap-to-capacity. An interesting avenue for further research is 
to incorporate shaping into the scheme in order to make the 
scheme attractive at low SNR. Another interesting question 
to be explored, is how IF equalization can be combined 
with iterative equalization methods such as turbo equalization. 
Since the noise enhancement the FFE in IF equalization causes 
is always smaller (or equal) to that of an ZF-LE, combining 
it with an iterative equalizer may result in better convergence. 
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Appendix 
The MMSE-IF Scheme 

It is well known that the MMSE-DFE is strictly better 
than the ZF-DFE. The differences between the two are more 
pronounced at low SNR, and vanish as the SNR increases (for 
channels that satisfy the Paley-Wiener condition). We now 
develop an MMSE version for IF as well, which is strictly 
better than ZF-IF. Nevertheless, at high SNR the schemes have 
similar performance. 

Our goal in designing the MMSE-IF equalizer is to produce 
an output, 

Xk *ik + ek = Xk + efc, 
that maximizes the SINR, which we define as: 



SINRmMSE-IF DFE 



E 



El 



Note that in fact we are interested in estimating x'^. = Xk ® ik- 
However, since we transform the linear convolution performed 
by the channel into a cyclic one, simply by zero padding the 
transmitted block and ignoring the last entries of the output 
of the FFE (as described in Section HIIll. there is no difference 
between estimating Xk and xj,. 



The output of the FFE, Vk, serves as an estimate for Xk 
from the sequence {yk}, and can be written as. 



V{D) = A{D)Y[D) 

= A{D)H{D)X{D) + A{D)N{D) 
( A{D)H{D) 

\ m 

X{D) + {G{D) - 1) X{D) + Z{D), 



I{D)X{D) 



1 I{D)X{D) + Z{D) 



where Z{D) is the filtered Gaussian noise, and G{D) is some 
linear time-invariant filter. Since we want our estimator to be 
unbiased (see \6l), we require that G{D) be monic (i.e, qq = 
1). In the time domain we have. 



Vk = Xk + 



^giXk-i + Zk 



Xk + e/c- 



(21) 



The derivation of the optimal MMSE-IF filter is reminiscent 
of finding the optimal MMSE-FFE in the classic DFE scheme 
||6l. However, there is one difference. In the classic DFE 
scheme the objective is to estimate the data symbols which are 
uncorrected. Thus, the estimated symbol Xk and the residual 
noise (which is composed of data symbols from different 
time instances and Gaussian noise) are also uncorrected. 
In the MMSE-IF scheme, we estimate an (integral) linear 
combination of the data symbols. For this reason, different 
samples of Xk are correlated and therefore Xk and Ck are 
correlated as well. We would like to be able to follow the 
derivation of the optimal MMSE-FFE JS). To this end, we 
may remove the correlation between Xk and efc by dithering 
as we show in Appendix lAl 

As shown in 1 6 1, if the output of an equalizer is composed of 
signal and noise which are uncorrected, maximization of the 
SINR is equivalent to minimization of the MSE. This imphes 
that the optimal choice for the front-end linear filter is given 
by the Wiener filter. 



A{D) = bo 



I{D)H*{D- 



H{D)H{D~*) + 



(22) 



The role of the factor bo in (|22] | is to guarantee that the 
estimator is unbiased, i.e., to guarantee that go = 1. Let 
See{e^'^) be the Power Spectral Density (PSD) of the residual 
noise (ignoring the bias-removal term bo). The filter I{D) 
should be chosen such that. 



\I{en? 



1 



-duj 



27r |i7(eJ'")|2 - 

is minimized. Following the derivation of Section IIVI we 
define. 



K 



MMSE.i 



fco 

k 



ko fc-1 . 

k„-2 kn^?. 



ko 
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Fig. 7. A schematic description of a MMSE-IF system. The notation modA denotes reducing modulo the interval [—A/2, A/2) 



where 



k — 



1 



1 



2n \H{e^^)\ 



2 _L 1 



is the autocorrelation sequence corresponding to, 

1 



K{D) = 



H{D)H*{D-*) 



Let pMMSE.ri be a matrix satisfying, 

KMMSE,ri — FlVlMSE.riFMMSE.n- 

The optimal choice of I{D) is given by finding the short- 
est vector in the lattice Fmmse.h, which can in practice be 
approximated by using the LLL algorithm as was explained 
in Section |IV] Note that the choice of I[D) is now SNR 
dependant. 



A. Removing Correlation by Dithering 

In order to remove the correlation between Xk and Cfe in 
(l2n i. we use a pseudo-random dither dk which is uniformly 
distributed over the modulo interval [—A/2, A/2), and is 
assumed to be known to both the transmitter and the receiver. 
As we recall, in the case of g-PAM transmission, i.e., the data 
symbols are 



Xk 



e cVSNR 



-{q-l) -iq-3) 



2q 



2q 



q-l 

2q 



the modulo interval is simply 
The transmitted sequence is 



=%/SNR/2, +cVSNR/2 



Xk = {xk + dk) mod [-A/2, A/2), 

which is uniformly distributed over [—A/2, A/2), and sta- 
tistically independent of Xk- Nevertheless, we stiU seek an 
estimator for Xk = Xk * ik- 
Define Xk as, 

Xk — Xk * Ik- 

The receiver's front-end filter is the optimal linear MMSE 
estimator for Xk from the sequence {yk}, given by. 



A{D) = bo 



I{D)H*{D-*) 
H{D)H{D-*) + ^- 



where 6o is a bias-removal term, serving the same purpose as 
6o in (|22] |. The receiver then computes, 

y'k = {vk - dk * ik) mod [-A/2, A/2) 

^fc + X! 9iXk^i + Zk - dk*ik\ mod [-A/2, A/2) 

= ^{xk*ik) mod [- A/2, A/2) 
+ {dk * ik) mod [-A/2, A/2) -~ dk * ik 
+ m^k^i + 2fc ) mod [-A/2, A/2) 

Xk^Y^ 9ixk~i + Zk mod [- A/2, A/2) 

1^0 ) 

= (ife + efc) mod [- A/2, A/2) 

where Xk and are uncorrelated. The complete system is 
illustrated in Figure |7] 
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